Fast Convegence Clustering Ensemble
نویسندگان
چکیده
Clustering ensemble combines some clustering outputs to obtain better results. High robustness, accuracy and stability are the most important characteristics of clustering ensembles. Previous clustering ensembles usually use k-means to generate ensemble members. The main problem of k-means is initial samples which have high effect on final results. Refining initial samples of kmeans increases the complexity of algorithm significantly. In this paper we try to predict initial samples, especially for clustering ensemble, without any increasing in time complexity. In this paper we introduce two approaches to select the initial samples of k-means intelligently to generate ensemble members. The proposed methods increase both accuracy and the speed of convergence without any increasing in time complexity. Selecting one sample from each cluster of previous result and selecting k samples which have minimum similarity to each other from coassociation matrix are the two proposed method in refining initial samples of k-means. Clarity, simplicity, fast convergence and higher accuracy are the most important parameters of proposed algorithm. Experimental results demonstrate the effect of proposed algorithm in convergence and accuracy of common datasets.
منابع مشابه
A Hybrid Framework for Building an Efficient Incremental Intrusion Detection System
In this paper, a boosting-based incremental hybrid intrusion detection system is introduced. This system combines incremental misuse detection and incremental anomaly detection. We use boosting ensemble of weak classifiers to implement misuse intrusion detection system. It can identify new classes types of intrusions that do not exist in the training dataset for incremental misuse detection. As...
متن کاملA new ensemble clustering method based on fuzzy cmeans clustering while maintaining diversity in ensemble
An ensemble clustering has been considered as one of the research approaches in data mining, pattern recognition, machine learning and artificial intelligence over the last decade. In clustering, the combination first produces several bases clustering, and then, for their aggregation, a function is used to create a final cluster that is as similar as possible to all the cluster bundles. The inp...
متن کاملThe ensemble clustering with maximize diversity using evolutionary optimization algorithms
Data clustering is one of the main steps in data mining, which is responsible for exploring hidden patterns in non-tagged data. Due to the complexity of the problem and the weakness of the basic clustering methods, most studies today are guided by clustering ensemble methods. Diversity in primary results is one of the most important factors that can affect the quality of the final results. Also...
متن کاملHigh-Dimensional Unsupervised Active Learning Method
In this work, a hierarchical ensemble of projected clustering algorithm for high-dimensional data is proposed. The basic concept of the algorithm is based on the active learning method (ALM) which is a fuzzy learning scheme, inspired by some behavioral features of human brain functionality. High-dimensional unsupervised active learning method (HUALM) is a clustering algorithm which blurs the da...
متن کاملWeighted Ensemble Clustering for Increasing the Accuracy of the Final Clustering
Clustering algorithms are highly dependent on different factors such as the number of clusters, the specific clustering algorithm, and the used distance measure. Inspired from ensemble classification, one approach to reduce the effect of these factors on the final clustering is ensemble clustering. Since weighting the base classifiers has been a successful idea in ensemble classification, in th...
متن کامل